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Single-subject  research  involves  the  comparison  of  an  individual’s  behavior  under 
experimental  and  control  conditions.  When  the  experimental  manipulation  involves  the 
introduction  of  a reinforcement  contingency,  an  appropriate  control  condition  involves 
the  elimination  of  this  contingency.  Several  control  techniques  have  been  used  in  applied 
research,  including  extinction,  noncontingent  reinforcement  (NCR),  and  differential 
reinforcement  of  other  behavior  (DRO);  however,  very  little  research  has  examined  the 
relative  strengths  and  weaknesses  of  these  techniques.  The  purpose  of  the  current  study 
was  to  compare  the  effects  of  extinction,  NCR,  and  DRO  with  respect  to  (a)  speed  of 
response  decrement,  (b)  amount  of  response  decrement,  and  (c)  any  positive  or  negative 
side  effects  associated  with  the  procedures.  Control  procedures  were  evaluated  using 
both  multielement  and  reversal  designs.  Results  indicated  that  extinction  generally 
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produced  the  most  consistent  and  rapid  reversal  effects  with  few  observed  negative  side 
effects. 
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INTRODUCTION 


A defining  feature  of  applied  behavior  analysis  is  the  use  of  single-subject  designs 
to  demonstrate  functional  relations  between  independent  and  dependent  variables  (Baer, 
Wolf,  & Risley,  1968).  When  a single-subject  design  is  used,  each  participant  is  exposed 
to  at  least  one  experimental  condition  in  which  an  independent  variable  is  introduced,  and 
at  least  one  control  in  which  the  independent  variable  is  absent.  When  the  independent 
variable  involves  the  introduction  of  a reinforcement  contingency,  the  appropriate  control 
condition  involves  the  elimination  of  this  contingency.  A functional  relation  (i.e., 
experimental  control)  is  demonstrated  if  rates  of  behavior  are  consistently  higher  when 
the  reinforcement  contingency  is  present  than  when  it  is  absent. 

The  purpose  of  this  study  was  to  compare  the  effects  of  three  control  procedures 
that  have  been  used  in  applied  research  on  the  effects  of  positive  reinforcement.  The 
control  procedure  used  most  commonly  is  the  “traditional  extinction”  control  condition, 
in  which  the  reinforcement  contingency  is  eliminated  by  completely  terminating  the 
delivery  of  reinforcement  (e.g.,  Craig  & Holland,  1970;  Hopkins,  Schutte,  & Garton, 
1971).  An  alternative  to  the  traditional  extinction  procedure  is  the  “noncontingent 
reinforcement”  (NCR)  control,  in  which  the  contingency  between  the  target  response  and 
reinforcer  presentation  is  eliminated  because  reinforcers  are  delivered  independent  of  the 
target  response  (Hart,  Reynolds,  Baer,  Brawley,  & Harris,  1968;  Lattal,  1969).  Finally, 
“differential  reinforcement  of  other  behavior”  (DRO)  schedules  have  been  used  as 
controls  for  positive  reinforcement.  With  a DRO  schedule,  the  contingency  between  the 
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target  response  and  reinforcer  delivery  is  reversed  such  that  reinforcers  are  delivered 
contingent  upon  the  absence  of  the  target  response  (e.g.,  Baer,  Peterson,  & Sherman, 
1967;  Reynolds  & Risley,  1968). 

Given  that  a number  of  control  procedures  have  been  used  to  demonstrate  the 
effects  of  positive  reinforcement,  it  may  be  helpful  to  discuss  some  factors  that  should  be 
considered  by  researchers  when  selecting  a control  procedure  for  a particular 
investigation.  From  a methodological  standpoint,  researchers  should  attempt  to  choose 
the  condition  that  contributes  to  the  most  convincing  demonstration  of  experimental 
control.  However,  methodological  considerations  must  be  balanced  with  the  practical 
concerns  of  applied  researchers.  Specifically,  applied  researchers  must  also  consider  the 
extent  to  which  a particular  control  condition  is  likely  to  produce  the  desired  effect 
quickly  with  minimal  negative  side  effects. 

Methodological  Appropriateness  of  Procedures 

According  to  Rescorla  (1967),  the  most  convincing  demonstration  of 
experimental  control  can  be  achieved  when  the  control  condition  contains  all  features  of 
the  experimental  condition,  while  eliminating  only  the  independent  variable.  If  the 
experimental  condition  contains  features  that  are  not  present  in  the  control  condition, 
behavior  change  cannot  be  attributed  to  the  independent  variable.  The  typical  positive 
reinforcement  arrangement  involves  two  key  features:  a stimulus  is  presented,  and  a 
contingency  is  arranged  between  the  performance  of  the  target  response  and  the  delivery 
of  that  stimulus.  Thus,  in  order  to  demonstrate  that  a behavior  change  is  produced  by  the 
reinforcement  contingency,  the  experiment  must  isolate  not  only  the  effects  of 
reinforcement  from  those  of  other  potentially  confounding  variables,  but  also  the  effects 
of  the  contingency  from  those  of  mere  stimulus  presentation. 
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When  the  traditional  extinction  control  is  used,  both  features  of  the  experimental 
condition  are  absent.  That  is,  the  traditional  extinction  condition  eliminates  stimulus 
presentations  as  well  as  the  contingency  between  the  target  response  and  delivery  of  the 
stimulus.  For  example,  Bailey  and  Meyerson  (1969)  used  vibration  as  reinforcement  to 
increase  lever  pressing  in  a young  boy.  A lever  was  mounted  on  the  side  of  the  boy’s 
crib,  and  during  reinforcement  conditions,  lever  presses  produced  access  to  vibration 
according  to  a continuous  (CRT)  schedule.  Rates  of  lever  pressing  in  this  reinforcement 
condition  were  compared  with  those  observed  in  an  extinction  control  condition  in  which 
vibration  was  discontinued.  Thus,  the  experimental  and  control  conditions  differed  with 
respect  to  both  the  delivery  of  vibration  and  the  presence  of  a contingency  between 
vibration  and  lever  pressing. 

Similarly,  Rheingold,  Gewirtz,  and  Ross  (1959)  compared  levels  of  infant 
vocalization  in  a social  reinforcement  condition  with  those  in  a traditional  extinction 
control  condition  in  which  no  social  stimulation  (e.g.,  smiling,  physical  interaction)  was 
presented.  The  authors  noted  that  the  use  of  the  traditional  extinction  control  condition 
limited  the  extent  to  which  increases  in  infant  vocalization  observed  in  the  reinforcement 
condition  could  be  attributed  to  the  reinforcement  contingency.  That  is,  because  no 
social  stimulation  was  presented  during  the  control  condition,  the  experimental 
arrangement  did  not  rule  out  the  possibility  that  infant  vocalizations  were  elicited  by  the 
mere  presentation  of  social  stimulation. 

As  illustrated  in  the  above  studies,  the  main  methodological  limitation  of  the 
traditional  extinction  control  condition  is  that  it  does  not  isolate  the  effects  of  the 
reinforcement  contingency  from  those  of  mere  stimulus  presentation.  To  convincingly 
demonstrate  that  a reinforcement  contingency  is  solely  responsible  for  behavior  change,  a 
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reinforcement  condition  should  be  compared  with  a control  condition  in  which  the 
reinforcement  contingency  is  eliminated,  but  the  reinforcing  stimulus  continues  to  be 
presented.  Thus,  the  NCR  control  condition  is  considered  to  be  the  most 
methodologically  appropriate  procedure  for  demonstrating  the  effects  of  positive 
reinforcement  (Rescorla  & Skucy,  1969). 

The  NCR  control  strategy  was  used  by  Buell,  Stoddard,  Harris,  and  Baer  (1968) 
to  evaluate  the  effects  of  social  reinforcement  on  a child’s  use  of  playground  equipment. 
During  the  reinforcement  condition,  teachers  provided  continuous  attention  whenever  the 
child  used  the  equipment.  Behavior  in  this  condition  was  compared  with  behavior 
observed  during  NCR  sessions  in  which  teacher  attention  was  provided  independent  of 
whether  the  participant  used  the  playground  equipment.  The  participant  used  the 
equipment  during  a higher  percentage  of  intervals  during  the  reinforcement  condition, 
demonstrating  that  the  reinforcement  contingency  was  responsible  for  the  behavior 
change. 

Bushell,  Wrobel,  and  Michaelis  (1968)  used  an  NCR  control  condition  in  a study 
evaluating  the  effects  of  token  reinforcement  on  the  study  behavior  of  children  in  a 
preschool  class.  During  the  reinforcement  condition,  study  behavior  (e.g.,  task 
engagement,  reciting  information  from  a Spanish  lesson)  produced  tokens  that  were 
exchangeable  for  access  to  a special  event  (e.g.,  movie,  trip).  Levels  of  study  behavior  in 
this  reinforcement  condition  were  compared  with  those  observed  in  an  NCR  condition  in 
which  tokens  continued  to  be  presented  contingent  upon  study  behavior,  but  special 
events  were  available  noncontingently.  The  authors  noted  that  this  control  condition 
allowed  them  to  isolate  the  effects  of  the  contingent  delivery  of  the  special  event  from  the 
effects  of  token  presentation  and  teacher  attention  related  to  token  presentation. 
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The  DRO  control  condition  also  involves  the  continued  presentation  of  stimuli 
delivered  during  the  reinforcement  condition,  and  therefore  might  be  considered  more 
methodologically  appropriate  than  the  traditional  extinction  condition.  However,  a 
methodological  limitation  of  the  DRO  control  is  that  the  original  reinforcement 
contingency  is  replaced  with  a new  contingency  that  was  not  present  in  the 
reinforcement  condition.  During  the  reinforcement  condition,  there  is  an  increased 
probability  of  reinforcement  given  the  target  response,  whereas  in  the  DRO  control,  there 
is  a decreased  (zero)  probability  of  reinforcement  given  the  target  response  because 
reinforcement  is  delivered  contingent  upon  the  absence  of  the  target  response.  Because 
the  DRO  control  introduces  a new  contingency  that  was  not  present  in  the  experimental 
condition,  DRO  might  be  considered  less  appropriate  than  the  NCR  control  as  an 
alternative  to  traditional  extinction. 

Brigham  and  Sherman  (1968)  used  a DRO  control  condition  in  a study  on  the 
effects  of  reinforcement  on  verbal  imitation  by  preschool  children.  In  the  reinforcement 
condition,  edible  items  or  tokens  were  delivered  contingent  upon  correct  imitation  of 
spoken  English  words.  Russian  words  were  also  presented;  however,  no  reinforcement 
was  delivered  for  correct  responses  to  these  stimuli.  During  a DRO  control  condition, 
reinforcement  was  delivered  contingent  upon  the  absence  of  imitation  (for  5 s)  following 
the  presentation  of  an  English  word,  and  no  reinforcement  was  provided  following  the 
presentation  of  Russian  words.  Children  engaged  in  consistently  higher  levels  of  correct 
imitation  of  both  English  and  Russian  words  during  the  reinforcement  condition  as 
compared  to  the  DRO  control  condition.  These  results  suggested  that  reinforcement 
contingent  upon  correct  imitation  increased  levels  of  correct  responding,  and  that  this 
effect  generalized  to  stimuli  (Russian  words)  that  were  not  correlated  with  reinforcement. 
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The  DRO  control  was  also  used  in  a study  by  Poulson  (1983)  that  evaluated  the 
effects  of  social  reinforcement  on  infant  vocalizations.  During  the  reinforcement 
condition,  infant  vocalization  resulted  in  social  stimulation  (e.g.,  talking  to  or  touching 
the  infant)  according  to  a CRF  schedule.  Levels  of  vocalization  in  this  condition  were 
compared  with  those  observed  in  a DRO  condition  in  which  social  stimulation  was 
presented  every  2 s,  but  was  delayed  for  4 s when  infant  vocalizations  occurred.  Higher 
levels  of  behavior  were  observed  in  the  reinforcement  condition  as  compared  to  the  DRO 
control  condition,  suggesting  that  infant  vocalizations  were  sensitive  to  the  reinforcement 
contingency.  This  study  represents  a methodological  improvement  over  previous  studies 
on  the  effects  of  social  reinforcement  on  infant  babbling  that  used  the  traditional 
extinction  control  and  failed  to  rule  out  the  alternative  interpretation  that  infant  vocal 
behavior  was  elicited  by  social  stimulation  (e.g.,  Rheingold  et  al.,  1959;  Todd  & Palmer, 
1968). 

To  summarize,  from  a methodological  standpoint,  NCR  is  considered  the  most 
appropriate  control  condition  for  demonstrating  the  effects  of  positive  reinforcement. 
Unlike  the  commonly  used  traditional  extinction  control,  the  NCR  control  isolates  the 
effects  of  the  reinforcement  contingency  from  the  effects  of  mere  stimulus  presentation, 
and,  unlike  the  DRO  control,  it  does  so  without  introducing  additional  variables. 

Practical  Considerations 

Aside  from  the  methodological  issues  described  above,  there  are  also  practical 
issues  that  researchers  may  wish  to  consider  when  choosing  a control  condition.  For 
example,  researchers  should  consider  the  extent  to  which  a particular  control  condition  is 
likely  to  contribute  to  an  efficient  demonstration  of  experimental  control.  The  most 
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efficient  demonstration  can  be  achieved  when  the  introduction  of  a control  condition 
results  in  a large  and  rapid  decrease  in  the  target  response. 

Studies  that  have  compared  traditional  extinction  with  NCR  and/or  DRO  have 
generally  found  that  extinction  produced  the  largest  and  most  rapid  reductions  in  the 
target  response.  For  example,  Uhl  and  Garcia  (1969)  found  that  traditional  extinction 
resulted  in  more  rapid  reductions  in  responding  than  did  DRO.  Similarly,  Rescorla  and 
Skucy  ( 1 969)  compared  the  effects  of  a traditional  extinction  condition  with  those 
obtained  with  both  NCR  and  DRO  schedules.  Results  indicated  that  all  three  of  the 
procedures  resulted  in  substantial  reductions  in  the  target  response;  however,  the 
extinction  condition  produced  the  largest  and  most  rapid  effects. 

Persistence  of  responding  during  NCR  conditions  has  been  observed  in  a number 
of  investigations  (e.g.,  Hermstein,  1966;  Neuringer,  1970;  Rescorla  & Skucy,  1969)  and 
has  often  been  attributed  to  the  accidental  reinforcement  of  the  target  response.  That  is, 
with  NCR,  stimuli  presented  independent  of  responding  may  nevertheless  be  delivered 
contiguous  with  the  target  response,  resulting  in  superstitious  maintenance  (Catania  & 
Keller,  1981;  Skinner,  1948).  Therefore,  although  NCR  is  the  most  methodologically 
advantageous  control  procedure,  a practical  limitation  of  the  procedure  is  the  possibility 
of  response  maintenance  due  to  adventitious  reinforcement. 

To  the  extent  that  persistence  of  responding  under  NCR  can  be  attributed  to  the 
presence  of  accidental  contiguities  between  the  target  response  and  reinforcer  delivery, 
DRO  may  represent  a practical  improvement  over  the  NCR  control.  That  is,  because 
reinforcers  are  delivered  contingent  upon  the  absence  of  the  target  response  during  DRO, 
there  is  no  opportunity  for  accidental  reinforcement.  Davis  and  Bitterman  (1971) 
compared  the  effects  of  DRO  and  NCR  with  rats  that  had  been  trained  on  a variable 
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interval  (VI)  - 30s  schedule  of  food  delivery  for  lever  pressing.  Results  showed  that 
DRO  produced  more  rapid  and  larger  decrements  in  responding  than  did  NCR. 

The  authors  attributed  persistence  of  responding  in  the  NCR  condition  to  adventitious 
reinforcement  because,  for  some  participants,  response  rates  increased  across  exposures 
to  the  NCR  schedule. 

Goetz,  Holmberg,  and  LeBlanc  (1975)  also  compared  DRO  and  NCR  as  control 
procedures  in  evaluating  a reinforcement  procedure  designed  to  increase  a young  girl’s 
compliance.  Reinforcement  consisted  of  teacher  proximity  contingent  upon  compliance 
with  requests.  The  level  of  compliance  observed  during  the  reinforcement  condition  was 
compared  with  the  level  of  compliance  during  two  control  conditions.  In  the  NCR 
control  condition,  teacher  proximity  was  presented  contingent  upon  both  compliance  and 
noncompliance;  during  the  DRO  control  condition,  teacher  proximity  was  presented 
contingent  upon  noncompliance.  Results  indicated  that  the  DRO  procedure  produced 
larger  and  more  rapid  decrements  in  responding  than  did  the  NCR  procedure.  It  is 
important  to  note  that  data  collected  on  independent  variable  implementation  suggested 
that  teachers  were  actually  reinforcing  compliance  during  the  NCR  condition;  therefore, 
the  conclusions  that  can  be  drawn  from  this  study  are  limited. 

Although  there  is  some  evidence  to  suggest  that  the  persistence  of  responding 
observed  with  NCR  schedules  can  be  attributed  to  accidental  reinforcement  (Davis  & 
Bitterman,  1971),  accidental  reinforcement  cannot  account  for  results  demonstrating  that 
traditional  extinction  reduces  behavior  more  quickly  than  DRO  (e.g.,  Rescorla  & Skucy, 
1960;  Uhl  & Garcia,  1969).  By  definition,  stimuli  are  never  delivered  contiguous  with 
responding  when  a DRO  schedule  is  implemented;  therefore,  there  is  no  opportunity  for 
accidental  reinforcement.  However,  some  researchers  have  suggested  that  the  mere 
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presentation  of  reinforcers  might  exert  some  stimulus  control  over  responding  even  when 
no  contingency  exists  (Uhl  & Garcia,  1 969).  If  the  presentation  of  stimuli  occasions 
responding  previously  reinforced  by  the  delivery  of  those  stimuli,  reductions  in 
responding  would  occur  more  slowly  with  DRO  than  with  extinction. 

In  addition  to  speed  and  amount  of  response  decrement  produced  by  the 
procedures,  it  might  also  be  important  to  consider  the  speed  with  which  responding  is 
recovered  following  exposure  to  a particular  control  condition.  Rapid  recovery  of  the 
response  under  reinforcement  conditions  also  contributes  to  an  efficient  demonstration  of 
control.  Goetz  et  al.  (1975)  examined  behavior  in  reinforcement  conditions  immediately 
following  exposure  to  NCR  and  DRO  control  conditions  and  found  no  differences  in  the 
speed  of  recovery;  however,  most  other  studies  have  not  addressed  this  issue. 

Researchers  should  also  consider  whether  a control  procedure  is  likely  to  produce 
negative  side  effects.  For  example,  the  traditional  extinction  condition  has  been 
associated  with  a number  of  undesirable  side  effects,  such  as  response  bursting  or 
emotional  responding  (see  Lerman  & Iwata,  1 996,  for  a review),  and  similar  negative 
side  effects  have  been  reported  when  DRO  has  been  used  to  reduce  problem  behavior 
(Cowdery,  Iwata,  & Pace,  1990;  Vollmer,  Iwata,  Zarcone,  Smith,  & Mazaleski,  1993). 

By  contrast,  NCR  has  been  reported  to  minimize  the  presence  of  side  effects  (Vollmer  et 
al.,  1998).  However,  because  the  occurrence  of  potential  negative  side  effects  of  these 
procedures  has  rarely  been  recorded,  additional  research  is  needed  to  determine  their 
prevalence.  For  example,  Lerman  and  Iwata  (1995)  examined  113  data  sets  and  found 
that  extinction  bursts  occurred  in  only  24%  of  cases,  suggesting  that  this  negative  side 
effect  may  be  less  prevalent  than  previously  assumed. 
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Practical  concerns  related  to  the  efficiency  of  the  control  procedures,  as  well  as 
side  effects  associated  with  the  procedures,  may  be  of  most  concern  in  applied  settings. 
Applied  researchers  should  attempt  to  identify  efficient  and  effective  solutions  to  the 
problems  with  which  they  are  presented.  In  addition,  applied  research  may  be  more 
welcomed  in  a variety  of  settings  to  the  extent  that  it  can  be  conducted  quickly  and  with 
little  disruption  to  ongoing  routine.  Therefore,  additional  research  is  needed  to  more 
clearly  describe  the  direct  and  indirect  effects  of  various  control  conditions  that  are  used 
frequently  by  applied  researchers. 

The  purpose  of  this  study  was  to  compare  extinction,  DRO,  and  NCR  as  control 
conditions  in  evaluating  the  effects  of  a reinforcement  contingency.  These  conditions 
were  compared  with  respect  to  (a)  rate  of  response  decrement,  (b)  amount  of  response 
decrement,  (c)  rate  of  response  recovery  during  subsequent  reinforcement  conditions,  and 
(d)  observed  side  effects  (e.g.,  emotional  responding,  aggression,  extinction  bursts). 


METHOD 


Participants  and  Setting 

Nine  individuals  with  developmental  disabilities  who  attended  a workshop 
program  participated.  Participant  information  is  presented  in  Table  1.  Sessions  were 
conducted  in  the  participants’  workshop  or  group  home  in  quiet  areas  that  contained 
tables,  chairs,  and  materials  needed  to  conduct  sessions  (see  below).  Sessions  were  10 
min  in  duration  and  were  conducted  2 to  4 times  per  day,  4 to  5 days  per  week. 


Table  1. 

Participant  Characteristics 


Name 

Age 

Diagnosis/Sensory  Impairments 

Kal 

47 

Profound  MR,  visual  impairment 

Amy 

39 

Severe  MR,  seizure  disorder 

Sven 

45 

Profound  MR,  autism 

Brad 

36 

Mild  MR 

Phyllis 

30 

Profound  MR 

Julie 

38 

Severe  MR,  Down’s  Syndrome 

Peg 

50 

Moderate  MR,  Prader-Willi  Syndrome 

Larry 

34 

Moderate  MR,  seizure  disorder,  hearing  impairment 

Lynn 

42 

Mild  MR,  seizure  disorder 
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Response  Measurement  and  Interobserver  Agreement 

Observers  recorded  the  frequency  of  several  events.  Target  behaviors  for 
individual  participants  included  the  following:  emitting  the  manual  (American  Sign 
Language)  sign  for  “please”  (Kal),  stair  stepping  (Amy  and  Larry),  pressing  a button  on  a 
voice-output  device  that  emitted  the  message  “snack  please”  (Sven),  toe  touching  (Brad), 
microswitch  pressing  (Phyllis,  Julie,  and  Pam),  and  silverware  sorting  (Lynn). 

Reinforcer  delivery  was  recorded  each  time  the  therapist  placed  an  edible  item  onto  the 
participant’s  plate  or  placed  a token  into  the  designated  container  (Peg  only).  Requests 
were  recorded  any  time  a participant  verbally  requested  (i.e.,  vocally  or  through  manual 
sign)  the  item,  pointed  at,  or  attempted  to  grab  the  item.  An  attempt  to  leave  the  session 
was  recorded  each  time  a participant  verbally  requested  to  leave  the  area  or  attempted  to 
leave  the  area  without  permission.  Negative  vocalizations  were  recorded  when 
participants  whined,  cried,  or  complained  about  sessions.  Problem  behavior  was 
recorded  when  participants  engaged  in  self-injury  (e.g.,  head  hitting,  skin  picking), 
aggression  (hitting,  kicking,  scratching,  biting  therapist),  and  disruptive  behavior 
(throwing  or  attempting  to  destroy  materials).  Requests,  attempts  to  leave,  negative 
vocalizations,  and  problem  behavior  were  rarely  observed  with  the  participants. 

Therefore,  data  on  requests  will  be  reported  for  Kal  only;  data  on  the  remaining  behaviors 
are  not  reported  for  any  of  the  participants  because  these  responses  were  observed 
extremely  infrequently. 

Trained  observers  collected  data  using  laptop  computers.  Interobserver 
agreement  was  assessed  by  having  a second  observer  simultaneously  but  independently 
record  data  during  a minimum  of  30%  of  sessions  for  each  participant  (range,  30.3%  to 
52%).  Agreement  percentages  were  calculated  by  partitioning  session  time  into  10  s 
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intervals  and  comparing  observers’  records  on  an  interval-by-interval  basis.  The  smaller 
number  of  responses  in  each  interval  was  divided  by  the  larger  number  of  responses; 
these  fractions  were  then  averaged  across  intervals  and  multiplied  by  1 00%  to  obtain  a 
percentage  agreement  score.  Mean  percentage  agreement  across  participants  was  95.6% 
(range,  65.3%  to  100%)  for  the  target  behavior,  and  94.6%  (range,  64%  to  100%)  for 
reinforcer  delivery.  Data  on  requests  are  presented  only  for  Kal,  and  the  percentage 
agreement  for  requests  was  91.0%  (range,  69.6%  to  100%). 

Reinforcer  Selection 

Prior  to  the  start  of  the  study,  preference  assessments,  based  on  procedures 
described  by  DeLeon  and  Iwata  (1996),  were  conducted  with  each  participant.  Seven 
items  were  placed  in  an  array  in  front  of  the  participant,  who  was  then  allowed  to  select 
and  consume  one  item.  The  selected  item  was  then  removed  from  the  array  and  the 
participant  was  allowed  to  select  from  the  remaining  items.  This  process  continued  until 
all  items  were  selected  or  30  s passed  with  no  selection.  This  assessment  was  repeated  a 
minimum  of  five  times.  Selection  percentages  were  then  calculated  for  each  item  by 
dividing  the  number  of  times  that  the  item  was  selected  by  the  number  of  trials  on  which 
the  item  was  available.  Assessments  involved  the  presentation  of  edible  items  for  all 
participants  except  for  Peg,  who  participated  in  leisure  activity  assessment  due  to  dietary 
restrictions.  The  five  items  or  activities  (sewing,  painting,  magazines,  art,  and  stickers) 
identified  as  most  highly  preferred  for  Peg  were  chosen  for  use  in  the  study.  For  all  other 
participants,  an  item  was  chosen  from  among  the  items  ranked  in  the  top  three.  Stimuli 
selected  for  each  participant  were  mint  patties  (Kal),  pretzels  (Amy),  candy  coated 
chocolate  pieces  (Sven,  Phyllis,  Larry),  pieces  of  fig  bar  (Julie)  and  cereal  (Lynn).  Brad 
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participated  in  both  the  DRO  versus  EXT  and  the  NCR  versus  EXT  comparisons.  The 
preferred  items  selected  for  use  in  these  analyses  were  jelly  beans  and  candy  coated 
chocolate  pieces,  respectively. 


Procedures 

To  ensure  that  the  target  response  came  into  contact  with  the  contingencies 
associated  with  each  experimental  condition,  two  physical  prompts  to  perform  the 
response  were  delivered  prior  to  each  session.  The  designated  consequences  for 
responding  were  delivered  following  the  prompted  response.  Specifically,  prior  to 
sessions  in  the  baseline,  extinction,  NCR,  and  DRO  conditions,  the  therapist  physically 
prompted  the  target  response  and  then  provided  no  consequences.  Prior  to  sessions  in  the 
reinforcement  condition,  the  therapist  physically  prompted  the  target  response  and  then 
delivered  the  reinforcer. 

Baseline.  During  baseline  sessions,  task  materials  were  present,  and  the  target 
response  resulted  in  no  programmed  consequences. 

Reinforcement.  During  reinforcement  sessions,  a small  piece  of  the  edible  item  or 
1 token  was  delivered  contingent  upon  the  target  response  according  to  a fixed  ratio- 1 
(FR-1)  schedule.  Additional  prompts  to  perform  the  target  response  were  provided 
during  initial  FR-1  sessions  conducted  with  Sven  and  Phyllis.  Verbal  and  model  prompts 
were  provided  every  30  s if  these  participants  had  not  engaged  in  the  target  response 
within  the  last  30  s.  Prompts  were  discontinued  after  5 and  3 sessions  for  Sven  and 
Phyllis,  respectively.  No  prompts  were  delivered  during  subsequent  FR-1  sessions. 

Extinction.  During  extinction  sessions,  task  materials  were  present,  and  the  target 
response  resulted  in  no  programmed  consequences  (the  edible  items  or  tokens  were 
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absent).  An  extinction-with-food  condition  was  implemented  with  Lynn  only.  In  this 
condition,  Lynn’s  preferred  edible  item  was  present;  however,  no  consequences  were 
delivered  contingent  upon  performance  of  the  target  response.  Peg  was  allowed  to 
exchange  tokens  for  access  to  preferred  items  or  activities  immediately  following  FR- 1 
and  DRO  sessions. 

PRO.  During  DRO  sessions,  the  edible  item  or  token  was  delivered  contingent 
upon  the  absence  of  the  target  response  during  intervals  of  a specified  length.  The  length 
of  the  DRO  interval  was  equal  to  the  mean  inter-response  time  (IRT)  from  the  last  4 
sessions  of  the  previous  reinforcement  condition  (rounded  to  the  nearest  second).  The 
DRO  interval  was  reset  each  time  a target  response  occurred. 

NCR.  During  NCR  sessions,  the  edible  item  or  token  was  delivered  according  to 
a fixed-time  (FT)  schedule.  The  FT  schedule  was  equal  to  the  mean  inter-response  time 
(IRT)  from  the  last  4 sessions  of  the  previous  reinforcement  condition  (rounded  to  the 
nearest  second). 

Experimental  Design 

Control  conditions  were  compared  with  the  reinforcement  condition  in  reversal 
designs  for  all  participants.  Comparisons  between  control  conditions  were  made  using 
both  multielement  and  reversal  designs  for  each  participant.  Each  of  the  experimental 
conditions  was  continued  until  researchers  determined,  using  visual  inspection,  that  4 
consecutive  stable  points  were  obtained,  with  no  apparent  trends  in  the  data.  When  two 
conditions  were  compared  in  a multielement  design,  both  conditions  were  conducted  until 
stability  was  achieved  in  both  conditions. 


RESULTS 


PRO  versus  Extinction  Comparisons 

The  top  panel  of  Figure  1 shows  responses  per  minute  of  the  target  response 
during  the  DRO  and  EXT  comparison  for  Kal.  He  did  not  perform  the  target  response 
during  baseline;  however,  when  reinforcement  was  delivered  according  to  an  FR-1 
schedule,  his  responding  increased  gradually  and  maintained  at  a high  rate.  During  the 
multielement  comparison  of  the  DRO  and  EXT  control  conditions,  EXT  resulted  in  an 
immediate  reduction  in  the  target  response  to  near-zero  levels.  This  low  rate  of 
responding  was  maintained  throughout  the  multielement  comparison.  DRO  also  resulted 
in  an  immediate,  but  smaller,  decrease  in  the  target  response  and  a gradual  reduction  to 
near-zero  levels.  Following  this  multielement  comparison,  high  rates  of  the  target 
response  were  recovered  in  an  FR-1  phase  that  was  followed  by  a comparison  of  DRO 
and  EXT  in  a reversal  design.  EXT  again  resulted  in  an  immediate  reduction  in  the  target 
response  to  zero  levels,  and  responding  recovered  immediately  in  the  subsequent  FR-1 
condition.  As  in  the  multielement  comparison,  DRO  resulted  in  an  immediate,  but 
smaller,  decrease  in  the  target  response  followed  by  a gradual  decrease  in  responding  to 
near  zero  levels.  Responding  recovered  quickly  in  the  subsequent  FR-1  condition. 

The  bottom  panel  of  Figure  1 shows  the  responses  per  minute  of  requests 
displayed  by  Kal  during  the  various  experimental  conditions.  Requests  were  recorded 
when  Kal  pointed  to  his  mouth  and  said  “eee,”  moved  his  plate  toward  the  therapist,  or 
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attempted  to  reach  for  the  edible.  These  responses  rarely  occurred  during  the  baseline, 
FR-1  and  EXT  conditions;  however,  Kal  engaged  in  these  responses  at  high  rates  during 
the  DRO  condition.  Requests  generally  increased  during  the  first  few  sessions  of  both 
DRO  phases,  then  gradually  decreased  to  near-zero  levels  as  the  phase  continued.  It 
appeared  that  these  responses  were  accidentally  reinforced  during  initial  sessions  of  the 
DRO  phases;  however,  requests  eventually  decreased  presumably  due  to  the  absence  of  a 
programmed  contingency  of  reinforcement  for  these  responses. 


BL  FR-1  DRO  vs  EXT  FR-1  EXT  FR-1  DRO  FR-1 


Sessions 

Figure  1 : Responses  per  minute  of  the  target  response  (top  panel)  and  requests  (bottom 
panel)  for  the  DRO  and  EXT  comparison  with  Kal. 
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Figure  2 shows  the  results  obtained  with  Amy.  Amy  engaged  in  the  target 
response  at  a very  low  rate  during  baseline  and  at  a high  rate  in  the  FR-1  condition.  In 
the  initial  sessions  of  the  multielement  comparison,  responding  decreased  to  a very  low 
rate  in  both  the  DRO  and  EXT  conditions.  Responding  was  maintained  at  near- zero 
levels  in  the  DRO  condition  throughout  the  phase;  however,  some  recovery  of  responding 
was  observed  in  the  EXT  condition,  followed  by  a decrease  in  responding  to  near-zero 
levels.  Responding  increased  to  a higher  rate  during  the  subsequent  FR-1  phase.  DRO 
and  EXT  were  then  compared  in  a reversal  design.  During  this  comparison,  DRO 
resulted  in  an  immediate  reduction  in  responding  to  near  zero,  and  responding  recovered 
quickly  in  the  subsequent  FR- 1 condition.  The  introduction  of  EXT  resulted  in  an  initial 


BL  FR-1  DRO  vs  EXT  FR-1  DRO  FR-1  EXT  FR-1 


Figure  2:  Responses  per  minute  of  the  target  response  during  the  DRO  and  EXT 
comparison  with  Amy. 


increase  in  responding,  followed  by  a gradual  reduction  to  near-zero  levels.  Responding 
recovered  quickly  in  the  subsequent  FR-1  phase. 
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Figure  3 shows  the  results  for  Sven.  Sven  never  performed  the  target  response 
during  baseline.  His  responding  gradually  increased  to  high  rates  during  the  first  FR-1 
phase.  During  the  multielement  comparison  of  DRO  and  EXT,  responding  decreased  to 
low  rates  in  the  EXT  condition;  however,  near-zero  levels  of  the  response  were  not 
achieved.  Responding  in  the  DRO  condition  was  variable  and  sometimes  high.  In  fact, 
the  highest  rate  of  responding  displayed  by  Sven  during  the  experiment  was  observed  in 
the  first  session  of  the  DRO  condition.  As  a result  of  this  high  rate  of  responding,  rates  of 
reinforcer  delivery  were  relatively  low  during  the  DRO  condition  (3.1  per  min)  as 
compared  to  those  during  the  last  4 sessions  of  the  previous  FR-1  condition  (5.6  per  min). 


00 


Sessions 

Figure  3:  Responses  per  minute  of  the  target  response  during  the  DRO  and  EXT 
comparison  with  Sven. 

Responding  was  high  in  the  subsequent  FR-1  condition.  When  DRO  and  EXT  were 
compared  in  a reversal  design,  EXT  resulted  in  a substantial  decrease  in  responding 
followed  by  rapid  recovery  of  the  response  in  the  subsequent  FR-1  condition.  DRO  also 
resulted  in  a substantial  decrease  in  responding;  overall,  however,  more  responding 
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occurred  in  the  DRO  condition  than  was  observed  in  the  EXT  condition.  High  rates  of 
responding  were  not  recovered  until  the  third  session  of  the  subsequent  FR-1  condition. 

Figure  4 shows  the  results  for  the  DRO  and  EXT  comparison  for  Brad.  Brad 
never  performed  the  target  response  during  baseline  or  EXT  sessions,  and  performed  the 
target  response  at  high  and  stable  rates  during  all  sessions  in  the  FR-1  condition. 
Responding  occurred  during  only  the  first  session  of  each  DRO  phase. 


Figure  4:  Responses  per  minute  of  the  target  response  during  the  DRO  and  EXT 
comparison  with  Brad. 

Figure  5 shows  the  results  for  Phyllis,  who  engaged  in  the  target  response  at  very 
low  rates  during  baseline.  The  introduction  of  reinforcement  for  the  target  response 
resulted  in  a substantial  increase  in  the  response.  When  DRO  and  EXT  were  compared  in 
the  multielement  design,  both  reduced  the  target  response  to  low  levels  quickly;  however, 
fewer  responses  were  observed  in  the  EXT  condition.  Responding  recovered  slowly  after 
the  multielement  comparison  phase,  but  relatively  stable  and  high  rates  of  behavior  were 
obtained.  When  EXT  was  introduced,  the  target  response  decreased  to  near-zero  after 
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four  sessions.  Behavior  recovered  quickly  during  the  subsequent  FR-1  condition.  When 
DRO  was  introduced,  behavior  remained  variable  for  approximately  8 sessions  before 
decreasing  to  near- zero  levels.  Behavior  recovered  quickly  during  the  subsequent  FR-1 
condition. 


Sessions 


Figure  5:  Responses  per  minute  of  the  target  response  during  the  DRO  and  EXT 
comparison  with  Phyllis. 


NCR  versus  Extinction  Comparisons 

Julie’s  responding  during  the  NCR  and  EXT  evaluation  is  depicted  in  Figure  6. 
Julie  rarely  performed  the  target  response  during  baseline,  and  performed  the  target 
response  at  high  rates  when  reinforcement  was  introduced.  During  the  multielement 
comparison  of  NCR  and  EXT,  the  target  response  was  immediately  reduced  to  near  zero 
in  the  EXT  condition.  NCR  also  produced  a decrease  in  responding;  however, 
responding  maintained  at  moderate  rates  in  the  NCR  condition  throughout  the 
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multielement  phase.  After  recovering  high  rates  of  responding  in  the  FR- 1 condition, 
EXT  was  introduced.  Again,  EXT  resulted  in  an  immediate  decrease  in  responding  to 
near  zero,  and  behavior  recovered  quickly  during  the  subsequent  FR-1  condition.  NCR 
produced  a decrease  in  responding;  however,  moderate  rates  of  the  response  were 


maintained.  Behavior  recovered  relatively  quickly  in  the  subsequent  FR-1  condition. 
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Figure  6:  Responses  per  minute  of  the  target  response  during  the  NCR  and  EXT 
comparison  with  Julie. 

Peg’s  results  are  depicted  in  the  top  panel  of  Figure  7.  She  did  not  engage  in  the 
target  response  during  baseline  and  she  displayed  high  rates  of  the  target  response  when 
reinforcement  was  introduced.  When  NCR  and  EXT  were  compared  in  a multielement 
design,  neither  condition  produced  a consistent  decrease  in  responding.  When  EXT  was 
evaluated  in  the  reversal  design,  behavior  remained  variable  for  1 2 sessions  before 
decreasing  to  zero.  High  rates  of  the  target  response  were  recovered  quickly  during  the 
subsequent  FR-1  phase.  When  NCR  was  introduced,  behavior  remained  variable  for  13 
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Figure  7:  Responses  per  minute  of  the  target  response  during  the  NCR  and  EXT 
comparison  with  Peg  (top  panel)  and  Larry  (bottom  panel). 


sessions  before  decreasing  to  zero.  High  and  stable  rates  of  the  target  response  were 
recovered  relatively  slowly  after  exposure  to  the  NCR  condition. 
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The  bottom  panel  of  Figure  7 shows  the  results  obtained  with  Larry.  Larry  never 
performed  the  target  response  during  baseline,  NCR,  or  EXT  conditions;  however,  he 
engaged  in  the  target  response  at  high  and  consistent  rates  during  each  exposure  to  the 
FR-1  condition. 

Lynn’s  results  are  depicted  in  Figure  8.  Lynn  did  not  perform  the  target  response 
during  baseline,  and  her  responding  increased  to  high  and  stable  rates  during  the  FR-1 
condition.  When  NCR  and  EXT  were  compared  in  a multielement  design,  EXT  reduced 
her  responding  to  zero  after  5 sessions;  however,  her  responding  persisted  at  high  and 
stable  rates  in  the  NCR  condition.  Responding  recovered  after  4 sessions  during  the 
subsequent  FR-1  phase.  When  EXT  was  implemented  in  the  reversal  design,  responding 
was  reduced  to  zero  by  the  second  session,  and  the  target  response  recovered  immediately 
during  the  subsequent  FR-1  phase.  When  NCR  was  implemented,  responding  again  was 
maintained  at  high  and  stable  levels.  In  an  attempt  to  identify  factors  contributing  to 
response  maintenance  under  NCR,  we  conducted  some  additional  manipulations.  The 
first  manipulation  was  conducted  in  order  to  determine  whether  responding  had  been 
accidentally  reinforced  during  the  NCR  control  condition  due  to  unprogrammed 
contiguity  between  the  target  response  and  the  delivery  of  reinforcement.  A DRO 
condition  was  implemented  in  which  the  reinforcer  could  not  be  delivered  contiguous 
with  responding  because  reinforcement  was  delivered  contingent  upon  the  absence  of 
responding.  The  DRO  interval  was  equal  to  the  FT  schedule  according  to  which 
reinforcement  was  presented  during  the  NCR  condition.  Responding  persisted  at  high 
levels  in  the  DRO  condition,  and,  as  a result,  the  rate  of  reinforcer  delivery  during  the 
DRO  condition  was  extremely  low  (0.6  per  min)  as  compared  to  reinforcement  rates 
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Figure  8:  Responses  per  minute  of  the  target  response  during  the  NCR  and  EXT 
comparison  with  Lynn. 

during  the  previous  FR-1  (7.2  per  min)  and  NCR  (7.3  per  min)  conditions.  Persistence  of 
responding  in  the  DRO  condition  provided  compelling  evidence  against  the  role  of 
accidental  reinforcement  in  the  maintenance  of  responding  during  NCR. 

A second  manipulation  was  implemented  to  determine  whether  responding  was 
under  stimulus  control  of  the  presence  of  the  reinforcer.  Responding  occurred  at  low 
rates  during  conditions  in  which  the  reinforcer  was  absent  (baseline  and  EXT)  conditions, 
but  at  high  rates  during  conditions  in  which  the  reinforcer  was  present  (FR-1,  NCR,  and 
DRO).  Therefore,  it  was  possible  that  the  mere  presence  of  the  reinforcer  was  sufficient 
to  occasion  responding.  To  evaluate  this  possibility,  we  conducted  an  EXT  w/  food 
condition  in  which  the  reinforcer  was  in  view,  but  was  not  delivered  following 
performance  of  the  target  response.  Lynn  never  emitted  the  target  response  during  this 
condition,  suggesting  that  her  responding  was  not  occasioned  by  the  mere  presence  of  the 
reinforcer.  These  results  led  to  the  conclusion  that  Lynn’s  responding  had  come  under 
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stimulus  control  of  reinforcer  delivery  rather  than  the  presence  of  the  reinforcer  per  se. 
Finally,  her  responding  recovered  slowly  during  the  last  FR-1  condition. 

Brad’s  results  are  depicted  in  Figure  9.  Brad  never  performed  the  target  response 
during  baseline  or  EXT  sessions,  and  he  displayed  high  and  consistent  rates  of 
responding  during  all  FR-1  sessions.  When  NCR  was  presented  in  the  multielement 
comparison,  responding  decreased,  but  maintained  at  moderate  rates.  When  NCR  was 
evaluated  in  the  reversal  design,  responding  maintained  at  high  but  variable  rates. 
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Figure  9:  Responses  per  minute  of  the  target  response  during  the  NCR  and  EXT 
comparison  with  Brad. 


A summary  of  the  results  from  the  DRO  versus  EXT  and  NCR  versus  EXT 
comparisons  is  provided  in  Table  1 . EXT  was  the  only  control  condition  that  resulted  in 
a substantial  reduction  in  the  target  response  in  all  cases.  EXT  also  produced  the  most 
rapid  reduction  in  the  target  response  in  most  (8/10)  cases.  A response  burst  occurred 
during  EXT  with  one  participant  and  during  DRO  with  one  participant;  response  bursts 
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were  never  observed  during  NCR.  Finally,  the  target  response  was  recovered  quickly 
following  EXT  in  all  cases;  recovery  occurred  more  slowly  following  NCR  in  one  case, 
and  DRO  in  one  case. 


Table  2 

Summary  of  Results  from  the  DRO  versus  EXT  and  NCR  versus  EXT  Comparisons  with 
all  Participants. 


Condition 

Effective 

Most  rapid 

No  burst 

Fast  recovery 

EXT 

10/10 

8/10 

9/10 

10/10 

NCR 

3/5 

0/5 

5/5 

3/5 

DRO 

5/6 

1/6 

5/6 

4/5 

DISCUSSION 


We  compared  a traditional  extinction  control  condition  with  NCR  and  DRO 
controls  and  found,  in  general,  that  extinction  produced  the  most  rapid  reversal  effects. 

In  every  case,  the  extinction  condition  produced  a substantial  reduction  in  the  target 
response,  and  in  8/10  cases  the  most  rapid  decrement  was  achieved  with  extinction. 
These  results  are  consistent  with  those  of  some  previous  studies  (e.g.,  Rescorla  & Skucy, 
1969;  Uhl  & Garcia,  1969)  that  compared  EXT  with  NCR  and/or  DRO.  Given  these 
findings,  it  is  not  surprising  that  the  traditional  extinction  condition  is  the  control  most 
commonly  used  in  applied  research. 

The  NCR  control  condition  was  evaluated  with  5 participants;  in  two  of  these 
cases  (Lynn  and  Brad),  NCR  failed  to  produce  a substantial  reduction  in  the  target 
response.  For  one  additional  participant,  NCR  resulted  in  a small  but  consistent  decrease 
in  the  target  response.  These  results  are  consistent  with  previous  research  that  has 
demonstrated  response  maintenance  during  NCR  conditions  (e.g.,  Konarski,  Johnson, 
Crowell,  & Whitman,  1980;  Osborne,  1969).  The  persistence  of  behavior  in  NCR 
conditions  has  often  been  attributed  to  “accidental  reinforcement,”  which  may  occur 
when  stimuli  are  delivered  contiguous  with,  although  not  contingent  upon,  responding. 

It  is  possible  that  response  maintenance  observed  with  some  participants  during 
NCR  was  a result  of  accidental  reinforcement.  The  FT  schedules  used  in  the  NCR 
condition  were  based  on  the  rate  of  reinforcement  during  the  FR-1  condition.  Therefore, 
if  responding  occurred  during  NCR  conditions  at  rates  similar  to  those  observed  during 
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the  FR-1  condition,  the  probability  of  accidental  contiguity  between  the  target  response 
and  reinforcer  delivery  was  very  high.  Thus,  it  is  possible  that  the  probability  of 
accidental  reinforcement  was  elevated  because  reinforcement  rate  was  equated  across 
experimental  and  NCR  control  conditions. 

In  order  to  evaluate  the  role  of  accidental  reinforcement  in  response  maintenance 
observed  during  NCR  with  Lynn,  we  eliminated  the  possibility  of  accidental  contiguity 
by  conducting  a DRO  condition.  Surprisingly,  we  observed  that  Lynn’s  behavior  also 
persisted  in  the  DRO  condition,  suggesting  that  response  maintenance  was  not  a result  of 
accidental  reinforcement.  An  alternative  explanation  for  the  persistence  of  responding  in 
the  NCR  and  DRO  conditions  was  that  the  presence  of  the  reinforcer  (pieces  of  cereal) 
exerted  stimulus  control  over  responding.  This  interpretation  would  account  for  the  low 
levels  of  behavior  observed  in  the  extinction  condition  in  which  the  reinforcer  was 
absent,  and  maintenance  of  responding  in  the  NCR  and  DRO  conditions  in  which  the 
reinforcer  was  present. 

To  determine  whether  the  presence  of  food  exerted  stimulus  control  over  Lynn’s 
responding,  we  conducted  an  extinction  with  food  condition  in  which  the  reinforcer  was 
present  but  was  not  delivered  at  any  time  during  the  session.  Lynn  never  engaged  in  the 
target  response  during  this  condition,  suggesting  that  the  mere  presence  of  the  reinforcer 
was  not  sufficient  to  occasion  responding.  These  results  led  to  the  conclusion  that  the 
delivery  (not  the  mere  presence)  of  the  reinforcer  served  as  a discriminative  stimulus  that 
was  sufficient  to  occasion  responding. 

Results  obtained  with  Lynn  are  consistent  with  results  of  the  Uhl  and  Garcia 
(1969)  study  that  examined  factors  contributing  to  the  persistence  of  responding  during 
DRO.  The  authors  attributed  this  phenomenon  to  the  production  of  a behavioral  chain 
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during  contingent  reinforcement  conditions.  Specifically,  they  suggested  that,  during 
contingent  reinforcement  conditions,  the  consumption  of  reinforcement  becomes 
discriminative  for  further  responding.  Thus,  the  subsequent  delivery  of  reinforcers 
during  DRO  conditions  occasions  responding,  resulting  in  the  retardation  of  response 
decrement.  In  order  to  evaluate  the  discriminative  effect  of  reinforcer  delivery,  the 
authors  eliminated  the  opportunity  to  respond  immediately  after  a reinforcer  delivery  (by 
retracting  the  lever)  and  found  that  this  procedure  eliminated  the  differences  in  speed  of 
response  decrement  observed  when  DRO  and  extinction  were  compared  previously. 

Koegel  and  Rincover  (1977)  also  found  evidence  of  the  discriminative  properties 
of  reinforcer  delivery.  During  the  training  phase  of  this  study,  children  received  edible 
reinforcers  and  praise  for  following  a model  and/or  verbal  instruction  to  complete  a 
simple  task  (e.g.,  raise  your  arm,  touch  your  nose).  Two  of  the  children  then  were 
exposed  to  a condition  in  which  reinforcers  were  delivered  after  10  consecutive  incorrect 
trials  (i.e.,  DRO).  Results  showed  that  correct  responses  increased  substantially 
immediately  after  the  initial  reinforcer  deliveries;  however,  this  effect  diminished  across 
repeated  reinforcer  deliveries.  Taken  together,  the  results  of  these  studies  suggest  that  the 
presentation  of  reinforcers  may  exert  stimulus  control  over  responding.  Furthermore, 
some  cases  of  response  persistence  during  NCR  that  have  previously  been  attributed  to 
accidental  reinforcement  may  be  a result  of  stimulus  control  by  reinforcer  presentation. 

Although  data  from  the  extinction  with  food  condition  conducted  with  Lynn 
indicated  that  the  presence  of  food  did  not  occasion  her  responding,  it  appears  that  the 
presence/absence  of  food  exerted  some  control  over  the  responding  of  other  participants. 
For  example,  during  the  Ext  and  DRO  comparison.  Brad  never  performed  the  target 
response  in  any  of  the  extinction  sessions,  whereas  he  engaged  in  the  target  response  in 
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the  first  DRO  session  of  each  phase.  In  addition,  at  least  some  of  the  responses  in  these 
initial  DRO  sessions  occurred  before  the  first  reinforcer  delivery,  suggesting  that  the 
presence  of  the  food  (not  the  delivery  of  food)  was  sufficient  to  occasion  responding.  It 
is  important  to  note  that  any  differences  in  responding  between  control  conditions  cannot 
be  attributed  to  pre-session  prompts  because  these  prompts  were  identical  across  the  three 
control  conditions.  That  is,  prior  to  each  EXT,  DRO,  and  NCR  sessions  the  therapist 
physically  guided  the  participant  to  perform  the  target  response  and  provided  no 
consequence. 

Pre-session  prompts  were  included  to  ensure  that  the  participants’  behavior  came 
into  contact  with  the  consequence  for  responding  prior  to  each  session  and  apparently 
affected  Larry’s  performance  dramatically.  Because  he  never  performed  the  target 
response  during  EXT  or  NCR  sessions,  zero  levels  of  responding  obtained  in  these 
conditions  can  only  be  attributed  to  exposure  to  contingencies  during  pre-session 
prompts.  Judging  from  this  performance,  it  seems  likely  that  Larry’s  responding  would 
have  decreased  rapidly  in  both  conditions  (EXT  and  NCR)  even  if  pre-session  prompts 
were  not  delivered,  because  two  unreinforced  responses  (during  pre-session  prompting) 
were  sufficient  to  eliminate  responding. 

We  examined  data  from  FR-1  phases  that  were  presented  following  each  of  the 
three  control  conditions  during  the  reversal  comparison  and  found  that  high  levels  of  the 
target  response  were  recovered  within  the  first  session  after  exposure  to  extinction  in  all 
1 0 cases.  More  delayed  recovery  of  the  target  response  was  observed  following  exposure 
to  NCR  with  2 participants  (Peg  and  Julie),  and  DRO  with  1 participant  (Sven).  Thus, 
with  respect  to  speed  of  recovery  following  exposure  to  the  control  conditions,  the 
traditional  extinction  condition  was  again  found  to  be  most  efficient.  Given  the  similarity 
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in  stimulus  conditions  between  NCR  or  DRO  and  FR-1  conditions  (i.e.,  the  delivery  of 
food),  it  is  not  surprising  that  more  extended  exposure  to  the  contingencies  would  be 
required  in  some  cases  to  produce  behavior  change  following  transitions  among  these 
conditions. 

The  introduction  of  extinction  resulted  in  a response  burst  in  only  1 case  (Amy) 
out  of  10.  These  results  , along  with  those  reported  by  Lerman  and  Iwata  (1995)  and 
Lerman,  Iwata,  and  Wallace  (1999),  suggest  that  extinction  bursts  may  be  less  common 
than  previously  assumed.  A response  burst  was  also  observed  in  the  DRO  condition  with 
one  participant.  Sven  displayed  his  highest  levels  of  responding  in  the  first  DRO  session 
of  the  multielement  comparison  between  DRO  and  EXT.  However,  this  session  might  be 
considered  functionally  equivalent  to  an  extinction  session  because  high  rates  of 
responding  resulted  in  a very  low  rate  of  reinforcement  (0.7  per  min)  according  to  the 
DRO  schedule.  By  contrast,  a high  rate  of  reinforcement  was  delivered  in  all  NCR 
sessions,  and  response  bursts  were  never  observed.  High  rates  of  reinforcement  delivered 
during  NCR  may  have  eliminated  the  establishing  operation,  thus  decreasing  the 
probability  of  response  bursts  (Vollmer  et  al.,  1998).  Although  response  bursts  are 
generally  considered  a negative  side  effect,  this  phenomenon  may  be  less  problematic 
when  the  target  response  is  an  appropriate  (e.g.,  microswitch  pressing)  rather  than  an 
inappropriate  response  (e.g.,  self-injury). 

In  addition  to  evaluating  the  direct  effects  of  control  conditions  on  the  target 
response,  we  were  also  interested  in  determining  the  extent  to  which  each  of  these 
conditions  produced  changes  in  non-target  responses.  Therefore,  we  recorded  a number 
of  categories  of  behavior  for  each.  However,  negative  side  effects  such  as  negative 
vocalizations,  attempts  to  leave  the  session,  and  problem  behavior  were  rarely  observed 
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with  any  of  the  participants.  Participants  engaged  in  negative  behaviors  so  infrequently 
that  these  responses  did  not  appear  to  be  systematically  related  to  any  of  the  control 
conditions. 

Although  researchers  have  supposed  that  non-target  responses  might  increase 
with  DRO  schedules,  few  studies  have  actually  recorded  “other”  responses  when 
evaluating  DRO  (Poling  & Ryan,  1 982).  One  notable  exception  is  a study  by  Zeiler 
(1970),  in  which  six  preschool  children  were  taught  to  press  two  concurrently  available 
levers  to  produce  pieces  of  candy.  After  this  history  of  reinforcement  was  established,  a 
DRO  contingency  was  arranged  for  responses  on  the  right  lever,  and  responses  on  the  left 
lever  had  no  effect  on  the  DRO  schedule.  Results  showed  that  4 of  the  6 children 
continued  to  respond  on  the  left  lever  at  very  high  rates,  suggesting  that,  in  some  cases, 
“other”  responses  may  be  reinforced  when  DRO  is  implemented. 

We  attempted  to  evaluate  the  effects  of  DRO  on  non-target  responses  and 
observed  an  interesting  effect  with  one  participant.  Kal  displayed  requests  for  the 
reinforcer  (e.g.,  pointing  to  the  reinforcer,  signing  eat,  moving  his  plate  toward  the 
therapist)  at  high  rates  during  many  of  the  DRO  sessions,  and  rarely  engaged  in  these 
behaviors  during  the  other  experimental  conditions.  These  responses  increased  during 
the  beginning  of  each  DRO  phase  and  gradually  decreased  across  DRO  sessions. 
Examination  of  the  data  streams  from  sessions  in  which  requests  for  the  reinforcer  were 
high  showed  that  requests  were  immediately  followed  by  the  presentation  of  the 
reinforcer  on  many  occasions.  Thus,  it  is  possible  that  requests  for  the  reinforcer  were 
accidentally  reinforced  during  DRO.  However,  given  that  there  was  no  programmed 
contingency  for  these  responses,  it  is  not  surprising  that  this  effect  was  temporary. 
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Although  the  main  goal  of  this  study  was  to  address  practical  concerns  related  to 
the  use  of  EXT,  NCR  and  DRO  as  control  conditions,  data  on  the  rate  of  reinforcer 
delivery  during  each  of  the  conditions  also  allows  us  to  evaluate  these  procedures  from  a 
methodological  standpoint.  In  order  to  isolate  a reinforcement  contingency  from  the 
effects  of  mere  stimulus  presentation,  one  must  include  a control  condition  in  which  the 
reinforcing  stimulus  is  presented,  but  the  contingency  is  eliminated.  Further,  the  most 
convincing  demonstration  of  the  effects  of  the  contingency  can  be  achieved  when  the  rate 
of  stimulus  presentation  is  equated  across  experimental  and  control  conditions. 

In  this  study,  we  attempted  to  equate  the  rate  of  reinforcer  delivery  across 
experimental  and  control  conditions  by  basing  the  NCR  and  DRO  schedules  on  the  inter- 
reinforcement time  from  the  previous  FR-1  condition.  Therefore,  the  reinforcing 
stimulus  was  programmed  to  be  presented  at  approximately  the  same  rate  during  the  FR- 
1,  NCR,  and  DRO  conditions.  During  NCR  sessions,  stimuli  were  presented  according  to 
an  FT  schedule;  therefore,  actual  rates  of  reinforcer  delivery  were  equal  to  programmed 
rates.  However,  because  an  occurrence  of  the  target  response  delayed  the  delivery  of 
reinforcement  during  DRO,  the  actual  rate  of  reinforcer  delivery  was  sometimes 
substantially  lower  than  programmed  rates  of  delivery.  Thus,  a limitation  of  the  DRO 
control  is  an  inability  equate  rate  of  reinforcer  delivery  across  experimental  and  control 
conditions  because  response  rates  determine  reinforcement  rates. 

To  summarize,  the  traditional  extinction  control  produced  the  most  consistent, 
largest,  and  most  rapid  decreases  in  the  target  response  in  most  cases.  Thus,  although 
NCR  may  be  the  most  methodologically  advantageous  procedure,  the  traditional 
extinction  control  may  be  most  preferred  from  a practical  standpoint.  This  presents  a 
difficulty  for  applied  researchers  who  must  balance  methodological  and  practical 
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considerations.  However,  research  that  clearly  delineates  the  effects  of  various  control 
procedures  should  contribute  to  the  applied  researcher’s  ability  to  effectively  weigh  these 
concerns.  For  example,  if  an  experiment  is  to  take  place  during  a very  limited  time 
period,  the  researcher  may  select  traditional  extinction  over  the  NCR  control.  In  this 
case,  the  researcher  may  choose  to  sacrifice  some  methodological  rigor  to  increase  the 
efficiency  of  the  experiment. 

When  attempting  to  balance  methodological  and  practical  concerns,  it  might  be 
necessary  for  researchers  to  assess  the  feasibility  of  alternative  explanations  before 
conducting  lengthy  controls  to  rule  out  the  influence  of  these  variables.  For  example,  in 
this  study,  the  target  behavior  was  consistently  higher  in  the  FR- 1 condition  as  compared 
with  the  traditional  extinction  control.  One  explanation  for  this  finding  is  that  the 
reinforcement  contingency  arranged  in  the  FR-1  condition  resulted  in  increases  in  the 
target  response.  The  alternative  explanation  is  that  the  mere  presentation  of  stimuli  (e.g., 
pieces  of  candy)  during  the  FR-1  conditions  resulted  in  high  levels  of  the  target  response 
(e.g.,  toe  touching).  However,  most  researchers  would  agree  that  the  presentation  of 
candy  would  be  unlikely  to  produce  high  levels  of  toe  touching  in  the  absence  of  a history 
of  reinforcement  for  the  response;  thus,  one  might  consider  the  alternative  explanation  to 
be  infeasible. 

To  the  extent  that  extraneous  variables  are  considered  unlikely  to  exert  control  of 
responding,  one  might  be  less  concerned  about  designing  control  conditions  to  rule  out 
the  influence  of  these  variables.  However,  it  is  impossible  to  determine  the  influence  of 
specific  variables  without  subjecting  them  to  empirical  evaluations;  therefore,  all 
variables  may  be  considered  as  equally  likely  to  contribute  to  behavioral  effects 
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(Johnston  & Pennypacker  1993).  The  most  conservative  approach  is  to  design 
experiments  that  rule  out  the  influence  of  all  extraneous  variables. 

It  is  important  to  note  that  the  generality  of  these  findings  is  limited  because  target 
responses  were  exposed  to  very  brief  histories  of  reinforcement,  which  was  delivered 
according  to  very  dense  schedules.  For  example,  it  is  possible  that  the  traditional 
extinction  condition  would  have  produced  more  gradual  reductions  in  the  target  response 
and  more  response  bursting  following  a more  lengthy  exposure  to  an  intermittent 
reinforcement  schedule.  However,  given  the  current  results,  it  seems  unlikely  that  the 
NCR  and  DRO  control  conditions  would  have  been  more  effective  than  traditional 
extinction  following  intermittent  reinforcement.  It  seems  more  likely  that  longer 
reinforcement  conditions  or  intermittent  schedules  of  reinforcement  would  have  resulted 
in  more  gradual  decreases  in  the  target  response  across  all  three  control  conditions. 
Nevertheless,  future  research  should  continue  to  evaluate  the  direct  and  indirect  effects  of 
these  control  procedures  across  a wide  variety  of  conditions. 
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